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( 1 ) GENERAL INFORMATION: 

(Mi) NUMBER OF SEQUENCES: H 



( 2 ) INFORMATION FOR SEQ ED Kttl: 

{ i ) SEQUENCE CHARACTERISTTCS: 
(A ) LENGTH: 24 base* 
( B )TYPE; midck tad 
( C > STRANDEDNESS: smgfe 
(D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (psomic) 

< x i ) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

TGCCCAGCTC CTOOCCCCCC GCTT 



( 2 ) INFORMATION FOR SEQ ID NCfcZ 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 hues 
( B ) TYPE: nsddc tcid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

< i i ) MOLECULE TYPE: DNA (genomic) 

( * i ) SEQUENCE DESCRIPTION: SEQ ID NOifc 

CTGCATCAAC AC AGGCGCCT CTTC 



( 2 ) INFORMATION FOR SEQ ID NO J: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 27 base* 
( B )TYPE: nucleic end 
( C ) STRANDEDNESS: sbglc 
( D ) TOPOLOGY; Ibex 

( i i > MOLECULE TYPE: DNA (genomic) 

- < x i ) SEQUENCE DESCRIPTION: SEQ ID NOJ: 

rTCAAATGAG A T TOTOGGAA AATTGC 



( 2 ) INFORMATION FOR SEQ ID NO:4 : 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 bases 
( B ) TYPE: nudcic acid 
( C ) STRANDEDNESS; tingk 
(D ) TOPOLOGY: linear 



# # 



19 

( i i ) MOLECULE TYPE: DNA (genomic) 
( x i ) SEQUENCE DESCRIPTION: 5EQ ID NO:* 
AGATCATCTC TOCCTC AGT A TCTT 

( 2 ) INFORMATION FOR SEQ ID KW: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 base* 
( B )TYP£nocfcfcaad 
( C ) STRANDEDNESS: aingle 
( D ) TOPOLOGY: Iineir 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO*. 

CCACCCATGG CAAATTCCAT OGCA 
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-continued 



( 2 ) INFORMATION FOR SEQ ID NOri: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH 24 bases 
( B ) TYPE: oneiric irid 
{ C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (generate) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO& 

TCTAGACGOC AGGTCAOGTC CACC 



( 2 ) INFORMATION FOR SEQ Q> NO:7: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 12 amino acids 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:?: 

Asp Aip lie A * n Pid T b r Vsl Lcd Leo Lys Glu A r g 
I 5 10 



( 2 ) INFORMATION FOR SEQ ID NO* : 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 bases 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
(■ D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE DNA (genomic) 

( % i ) SEOUENCE DESCRIPTION: SEQ ID NOS: 



( 2 ) INFORMATION FOR SEQ ID NOS: 

( i ) SEQUENCE CHARACTERISTICS: 
• ( A ) LENGTH: 24 bases 
( B ) TYPE: nucleic xAA 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: Bsear 



( i i 



) MOLECULE TYPE: DNA (s^nnraic) 

) SEQUENCE DESCRIPTION: SEQ ED NO*: 
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-continued 



CTTCTACAGT TCAOTCOAAC OTTC 



( 2 ) INFORMATION FOR SEQ ID Mfclft 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 604 amino adds 
( B )TYPE: amino acal 
( C ) STRANDEDNE55: single 
( D ) TOPOLOGY: Kocar 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOiKh 



Met Leo Ala Arg Al 

1 5 

Tb r All Aid pro C y 
2 0 



Mo i Scr Val Gly P o 
35 

Ph c Ty t. G I y Q 1 v As 
5 0 

Lea Phc Leo Ly * Pr 
6 5 

Phc lyi Gly Phc Tr 
8 5 

All lie Met Scr Ty 
I 0 0 

Pro Pro Thr Tyr As 
1 1 5 

Ser Asa Leo Scr Ty 

1 3 0 

Cy a Pro Thr Pro Le 
1 4 5 

Aid C 1 o lie Val Cl 

1 6 

Pro Otn Gly Scr Ai 

1 8 0 

His Oln.Pbe Phc Ly 

1 9 5 

Gly Leo Gly His Gl 

2 10 

Al* Arg Gin Are Ly 

225 

G 1 o lie lie Aip Gl 

2 4 

Ala Gin Met lie Ty 

2 6 0 

Val Gly Gla Glo V a 

2 7 3 

Thr lie Tip Leo Ar 
2 9 0 

Gl tt Hit Pro Glo Tr 

305 

lie Leo lie Gly Gl 

3 2 



Leo Lea Leo Cy s 



C y i Scr 
A i p Glo 



C y * Set 

5 5 

Thr Pro 
7 0 



H i * 



Tyr 
40 



Ala Asp 



Tyr T b r 
1 3 5 

Gly Val 
1 5 0 



Thr Asp 



Val Asp 
2 1 5 

Leo Ate 
2 3 0 



Phc Gly 



Glo His 
2 9 5 

Gly Asp 

3 1 0 

Thr lie 



Loo 
2 8 0 



Hi i Leo Ser Gly Tyr His Phc 



Pro 
2 5 

Ly s 



A 1 a 

1 0 

Cy l 
C y s 



Val Leo Ala Leo Sor His 



Gla Aso Arc 



Asp Cy s 



Th r 
4 5 



Thr Pro Glo Phc 



Leo 
60 



oi y 

3 0 
Arg 

At g 



Asa Thr Va 



Hi s 

7 5 



Asn Val Va! Aao 



A s o 
9 0 



Val Loo Thr 



Ty r 
1 2 0 



Ser 
1 0 5 

Cl y 



Arg Ser His Loo 
Ty r 



Ly i Ser 



Tr p 
I 2 5 



Ar | Ala Leo Pro 



P r o 
1 4 0 



Lyi Gly Lys 



Ly s 
1 5 5 



Gin Leo Pro 



Lys Lob Leo Leo 



Ar g 
1 7 0 



Met Met Phc 



A 1 a 

1 8 5 



Pbe Phc Ala Gla 



H i a 

2 00 



Lys Arg Gly Pro 



Leo Asn Hit lie 



Lcn Pbe Lys 



Asp 

2 3 5 



Tyr 
2 2 0 

Gly 



GloMct Tyr Pro 



Pro 
2 5 0 



Thr Val 



A 1 a 

2 0 5 

Gly 
Ly * 
Ly * 



Pro Pro Gin 



Va 1 
2 6 5 



Pro Glo Hit Leo 



Val Pro 



G 1 y 

Aid Arg Val Cys 



Lea 



Asp 
3 0 0 



Mc t 
2 8 5 



Val Lei 



Glo Gin Lcn 



P h c 
3 1 5 



Glo Thr Ser 



o r 
1 3 



Va 1 



Tyr lie Leo Thr 



lie Pro Phc Leo 



Arg 

9 5 



Cyt 
Oly 
Lys 



H i s 
8 0 



Asn 



I 1 e 
1 I 0 



Asp Ser 



Glo Ala Phc 



Val Pro 



Arg Lys Phc Mc 



Asp 
Asp 



Pro 
1 7 5 



Asp 



Ser 
1 6 0 



Aip 



Hi » 
1 9 0 



Phc >Th r 



Pbe Thr Asn 



Glo Tbr Leo 



Met Lys 



Asp 



Arg 

27 0 



Thr 
25 3 



Tyr 
2 4 0 

G 1 n 



Phc Ala 



Tyr 
Lys 
Arg 



Al a 



G 1 i 



Lea 
3 20 



V a 1 

3 3 0 



I I e Glo Asp Tyr 



G 1 o 



Ly a 

Lya Leo Lys Phc Asp Pro Gin Leo Lc 



Va I 

3 33 
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-continued 



340 345 350 

Phc Asn Lyi Gin Pbe On Tyr GIo Aid Art * lc Alt Ala Gla Pbc Asa 
355 360 365 

Tbr Loo Tyr His T r p Hi* Pro Leo Leo Pro Asp Tbr Pbe Gin lie His 
370 375 3 8 0 

Asp Gin Ly i Tyr Aid Tyr OId Gla Pbc lie Tyr Asa Asn Scr lie Lea 
3 8 5 390 395 400 

Lcn GIo His Gly lie Tbr Ola Pbc Vil Gla Scr Phc Tbr A r g Gla lie 
4 0 5 4 10 4 15 

Ala Gly Arg Val Ala Gly Gly Arg Asa Val Pro Pro Ala Val Gla Lyi 
420 425 430 

Vol Scr Gla Ala Ser lie Asp Gla Set Arg Gla Met Lyi Tyr Gla Sor 
43S 440 445 

Phc Asa Gla Tyr Arg Lyi Arg Pbe Met Lea Lyi Pro Tyr Gla Scr Pbe 
450 455 460 

Gla Gla Lea Tbr Gly Olo Lys Gla Mel Ser Alt Olo Lea Gla Ala Lea 
465 470 475 480 

Tyr Gly Asp lie Asp Ala Val Gla Lea Tyr Pro Ala Lea Lea Vsl Ola 
485 490 495 

Lys Pro Arg Pro Asp Ala lie Pbc Gly Gla Tbr Mel Vsl Gla Vsl Gly 
500 505 510 

Ala Pro Phc Scr Leo Lys Gly Lea Mel Gly Asa Val lie Cys Scr Pro 
515 520 525 

Ala Tyr Trp Lys Pro Scr Tbr Phc Gly Oly GIo Vsl Gly Pbc Gin lie 
530 535 540 

lie Ass Tbr Als Ser 11c Gla Scr Leo lie Cys Asn Asa Val Lys Gly 
545 550 555 560 

Cys Pro Phc Tbr Scr Pbc Scr Val Pro Asp Pre GIo Leo lie Lys Tbr 
565 570 575 

Vsl Tbr 11c Asn Ala Scr Ser Ser Arg Scr Gly Leu Asp Asp lie Asn 
580 585 590 

Pro Tbr Va] Leo Lcn Lys GIo Arg Scr Tbr GIo Leo 



( 2 ) INFORMATION PGR SEQ ID Nail: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 3387 bases 
( B ) TYPE: nucleic *dd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DKA (genomic) 

( % i ) SEQUENCE DESCRIPTION: SEQ ID NO:U: 

GTCCAGGAAC TCCTCAGCAG CGCCTCCTTC AGCTCCACAG CCAGACGCCC TCAGACAGCA 60 

AAGCCTACCC CCGCGCCGCG CCCTGCCCGC CGCTGCGATG CTCGCCCGCG CCCTCCTCCT 120 

GTOCOCGGTC CTGGCGCTCA GCCATACAGC AAATCCTTGC TGTTCCCACC CATGTCA AAA 180 

CCGAGGTGTA TOTATGAGTG TGGGATTTGA CCAGTATAAG TGCGATTGTA CCCGGAC AGG 240 

ATTCTATGGA GAAAACTGCT CA AC ACCGGA ATTTTTGACA AGAATAAAAT TATTTCTOAA 300 

ACCCACTCCA AACACAOTGC ACTACATACT TACCCACTTC AAGCGATTTT GGAACGTTGT 360 

GAATAACATT CCCTTCCTTC GAAATGCAAT TATGAGTTAT GTGTTOACAT CCAGATCACA 420 

TTTGATTGAC AGT CCA CCA A CTTACAATGC TGACTATOGC TACAAAAGCT GOGAAGCCTT 480 

CTCTAACCTC TCCTATTATA CTAGAGCCCT TCCTCCTGTG CCTGATGATT GCCCGACTCC 540 
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-continued 

CTTOGOTOTC AAAOOTAAAA AGCAOCTTCC TGATTCAAAT OAOATTOTGO AAAAATTGCT 600 

TCTAAGAAGA A AGTTCATCC CTCATCCCCA GGGCTCAAAC ATGATGTTTG CATTCTTTGC 660 

CCAGCACTTC ACOCACCACT TTTTCAAGAC AG A T C A T A AG COAGGGCCAG CTTTCACCAA 720 

CGOGC TGGGC CATGOOOTGO A CTT A A AT C A TATTTACGGT GAAACTCTGG CTAGACAGCG 7 80 

T A AACTGCGC CTTTTC A AOG ATGGAAAAAT GAAATATCAG ATAATTGATG GAGAGATGTA 840 

TCCTCCCACA GTCAAAOATA CTCAGGCAGA GATGATCTAC CCTCCTCAAG TCCCTOAGCA 900 

TCT ACGOTTT GCTGTGGGOC AGGAGOTCTT TOOTCTGGTG CCTGOTCTGA TGATGTATGC 960 

CACAATCTGG CTGCGGGAAC ACAACAGAOT ATGTG ATGTG CTTAAACAGG AGCATCCTGA 1020 

ATGGGGTG AT GAGCAGTTGT TCCAGACAAG CAGGCTAATA CTGATAGGAG AGACTATTAA 1080 

GATTGTGATT GAAGATTATG TGCAACACTT G AG TGGCT A T CACTTCAAAC TG A A ATTTGA 1140 

CCC AG A ACT A CTTTTCAACA AACAATTCCA OTACCAAAAT CGTATTGCTG CTGAATTTAA 1200 

CACCCTCTAT CACTGGCATC CCCTTCTOCC TGAC ACCTTT CAAATTCATG ACCAGA AATA 1260 

CAACTATCAA CAGTTTATCT A C A AC A A CTC TAT A TTGCTG G A AC AT G G A A TTACCCAGTT 1320 

TGTTG A AT CA TTCACCAGGC AAATTCCTGG CAGGGTTGCT GGTGGT AGG A ATGTTCC ACC 1310 

CGCAGTAC AG AAAOTATCAC AGGCTTCCAT TGAC C AGAGC AGGCACATGA AATACCAGTC 1440 

TTTTAATCAG TACCGCAAAC GCTTTATGCT GAAGCCCTAT GAATCATTTG A AG A ACT T AC 1500 

AGG AG AAA AG GAAATGTCTG CAGAGTTGGA AGCACTCTAT GGTOACATCG ATGCTGTGGA 1560 

GCTGTATCCT GCCCTTCTGG TAG A A AAGCC TCGGCCAOAT GCCATCTTTG OTOAAACCAT 1620 

GGTAGAAGTT GGAGCACCAT TCTCCTTCAA AGGACTTATG CGTAATGTTA TATGTTCTCC 1680 

TG CCTACTGG AAGCCAAGCA CTTTTGOTGG AGAAGTGGGT TTTCA AATCA TCAACACTGC 1740 

CTCAATTCAG TCTCTCATCT GCAATAACGT GAAGGGCTGT CCCTTTACTT CATTCAGTGT 1800 

TCCAGATCCA GAGCTCATTA AA AC AGTCAC CATCAATGCA ACTTCTTCCC GCTCCGG ACT 1860 

AOATGATATC AATCCCACAG TACTACTAAA AOAACGGTCO ACTGA ACTGT A G A AGTCTA A 1920 

TGATCATATT TATTTATTTA TATGAACCAT GTCTATTAAT TTAATTATTT AATAATATTT 1980 

ATATTAAACT CCTTATGTTA CTTA ACATCT TCTGTAACAG AAGTC AGT AC TCCTGTTGCG 2040 

GAGAAAGGAG TCATACTTGT GAAGACTTTT ATGTCACTAC TCTAAAGATT TTOCTGTTOC 2100 

TGTTAAGTTT GG A A A ACAGT TTTTATTCTG TTTT ATAAAC CAG AGAG AAA TGAGTTTTG A 2160 

CGTCTTTTTA CTTGAATTTC AACTTATATT ATAAGGACGA AAGTAAAGAT GTTTG AATAC 2220 

TTAAACACTA TCACAAGATG CCAAAATGCT GAAAGTTTTT ACACTGTCG A TGTTTCCAAT 2280 

GCATCTTCCA TGATGCATTA GAAGTAACTA ATGTTTGA A A TTTTAAAGTA C TTTTGGGT A 2 3 4 0 

TTTTTCTGTC A T C A A A C A A A ACAGGTATC A GTGCATTATT AAA TG A A TAT TTAAATTAGA 2 4 0 0 

CATTACCAGT AATTTCATGT CTACTTTTTA AAATCAGCAA TGAAAC A ATA ATTTGAAATT 2460 

TCTAAATTCA T AGGGT AG A A TC ACCTOT A A A AGC TTGTTT G ATTTCTTA A AGTTATTAAA 2 5 2 0 

CTTGTACATA T A C C A A A A A G AAGCTGTCTT GGATTTAAAT CTGTAAAATC AGATGAAATT 2 5 8 0 

TTACTACAAT TGCTTOTT A A AATATTTTAT AAOTGATOTT CCTTTTTCAC CAAOAGTATA 2 6 4 0 

AACCTTTTTA GTGTGACTGT TAAAACTTCC TTTT A A A TC A A A ATGCC A A A TTTATTAAGG 2 7 00 

TGGTGGAGCC ACTGC AGTGT T ATCTC A A A A TAAGAATATC CTGTTG AG A T ATTCCAOA AT 2 7 6 0 

CTGTTTATAT OOCTGGTAAC ATGT A A A A AC CCCATAACCC CGCCAAAAGG OGTCCTACCC 2820 

TTGAACATAA AGCAATAACC A A AGG A G AA A AGCCCAAATT ATTOGT TCC A A ATTT AGOOT 2 8 8 0 

TTAAACTTTT TGAAGCAAAC TTTTTTTTAG CC TTGTGC AC TGC AG A CCT G GTACTCAG AT 2 9 40 
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TTTGCTATO A OGTTAATGAA GTACCAAGCT GTGCTTGAAT AACGAT A TOT TTTCTCAGAT 3 00 0 

TTTCTGTTGT ACAGTTTAAT TTAGCAGTCC AT ATCAC ATT OCAA A AG T AG CAATOACCTC 3 06 0 

ATAAAATACC TCTTC AAAAT GCTT AAATTC ATTTCACACA TTAATTTTAT CTCAGTCTTG 3 120 

AAGCC AATTC AGTAGGTGCA TTGGAATCAA GCCTOGCTAC CTOCATGCTG TTCCTTTTCT 3 180 

TTTCTT CTT T TAGCCATTTT GCTAAGAGAC ACAGTCTTCT CAAACACTTC OTTTCTCCTA 3 24 0 

TTTTGT TTT A CT AG TTTTAA GATCAGAGTT CACTTTCTTT CGACTCTGCC T AT ATT TT CT 3 3 0 0 

TACCTGAACT TTTGCAAOTT TTC AGGT A A A CCTCAGCTCA OGACTGCT A T TT A GCTCCTC 3 3 6 0 

TTAAGAAGAT TAAAAAAAAA AAAAAAG 338? 

( 2 ) INFORMATION FOR SEQ ID NO:12: 

( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 buct 
( B ) TYPE: nucleic icid 
( C ) STRANDEDNESS: sintic 
( D ) TOPOLOGY: Hocsr 

( i i )M0LECUUE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CCTTCCTTCG AAATGCAATT A 21 

( 2 ) INFORMATION FOR SEQ ID NO:13: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 bases 
< B ) TYPE; noefck tad 
( C ) STRANDED NESS: stogie 
( D) TOPOLOGY: fetor 

( i i ) MOLECULE TYPE: DNA (feoomk) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AAACTGATGC GTGA AGTGCT G 2 1 

( 2 ) INFORMATION FOR SEQ ID Nth 14: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 Vases 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: lioglc 
( D ) TOPOLOGY: Qncar 

( i i ) MOLECULE TYPE: DNA (genomic) 

( % 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

GAGATTGTGO OA A A ATTGCT T 2 1 



